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SEQUENCE LISTING 

C--> 3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Bandman, Olga 

6 Guegler, Karl J. 

7 Lai, Preeti 

C--> 9 (ii) TITLE OF INVENTION: SH 3 -CONTAINING PROTEINS 

12 (iii) NUMBER OF SEQUENCES: 5 

14 (iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

16 (B) STREET: 3174 Porter Dr. 

17 (C) CITY: Palo Alto 

18 (D) STATE: CA 

19 (E) COUNTRY: USA 

20 (F) ZIP: 94304 

22 (v) COMPUTER READABLE FORM: 

23 (A) MEDIUM TYPE: Diskette 

24 (B) COMPUTER: IBM Compatible 
2 5 (C) OPERATING SYSTEM: DOS 

26 (D) SOFTWARE: FastSEQ for Windows Version 2.0 

28 (vi) CURRENT APPLICATION DATA: 

C--> 29 (A) APPLICATION NUMBER: US/09/925 , 122 

C--> 30 (B) FILING DATE: 08-Aug-2001 

32 (vii) PRIOR APPLICATION DATA: 

33 (A) APPLICATION NUMBER: 09/294,545 

34 (B) FILING DATE: 1999-04-19 

36 (viii) ATTORNEY/AGENT INFORMATION: 

37 (A) NAME: Billings, Lucy J. 

38 (B) REGISTRATION NUMBER: 36,749 

39 (C) REFERENCE/DOCKET NUMBER: PF-0419 US 

41 (ix) TELECOMMUNICATION INFORMATION: 

42 (A) TELEPHONE: 650-855-0555 

43 (B) TELEFAX: 650-845-4166 
4 5 (2) INFORMATION FOR SEQ ID NO : 1: 

4 7 (i) SEQUENCE CHARACTERISTICS: 

4 8 (A) LENGTH: 265 amino acids 

4 9 (B) TYPE: amino acid 

50 (C) STRANDEDNESS : single 

51 (D) TOPOLOGY: linear 

53 (vii) IMMEDIATE SOURCE: 

54 (A) LIBRARY: BRAITUT03 

55 (B) CLONE: 865744 

57 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

59 Met Lys Arg Thr Val Ser Asp Asn Ser Leu Ser Asn Ser Arg Gly Glu 

60 1 5 10 15 

61 Gly Lys Pro Asp Leu Lys Phe Gly Gly Lys Ser Lys Gly Lys Leu Trp 

62 20 25 30 

6 3 Pro Phe lie Lys Lys Asn Lys Gly Ala Thr Pro Glu Asp Phe Ser Asn 



NTERED 
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01 



Input Set : 
Output Set: 
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66 
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60 










67 
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Glu 


He 
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Glu 


Met 
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Arg 
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Ala 


He 
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68 


65 










70 










75 










80 


69 
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Met 
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Asp 


Val 
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Leu 


Lys 
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Pro 
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Met 
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Asp 


Pro 
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70 










85 










90 










95 




71 


Ser 


Leu 


Asp 


His 


Lys 


Leu 


Ala 


Glu 


Val 


Ser 


Gin 


Asn 


He 


Glu 


Lys 


Leu 


72 








100 










105 










110 






73 


Arg 


Val 


Glu 


Thr 


Gin 


Lys 


Phe 


Glu 


Ala 


Trp 


Leu 


Ala 


Glu 


Val 


Glu 


Gly 


74 






115 










120 










125 








75 


Arg 


Leu 


Pro 


Ala 


Arg 


Asn 


Glu 


Gin 


Ala 


Arg 


Arg 


Gin 


Ser 


Gly 


Leu 


Tyr 
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Ser 


Gin 
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Pro 


Pro 


Thr 


Val 
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Ala 
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Asp 


Arg 


Glu 
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150 
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Pro 


Asp 


Gly 


Ser 


Tyr 
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Glu 


Glu 
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Ser 


Gin 


Glu 


Ser 


Glu 


Met 
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Asp 


Glu 
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Asp 


Asp 


Glu 


Glu 


Pro 
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83 


Leu 


Pro 


Ala 


He 


Gly 
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Phe 


Glu 


Gly 


Gin 


84 
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Glu 
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He 
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Val 


Val 


Glu 


Gly 


Glu 


Thr 
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Tyr 


Val 


He 


86 
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215 
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87 


Glu 


Glu 


Asp 


Lys 


Gly 


Asp 
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Trp 
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Arg 


He 


Arg 


Arg 
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Glu 


Asp 


88 


225 










230 
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Glu 
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Val 
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Val 
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Thr 


Tyr 


He 
















92 








260 










265 

















94 (2) INFORMATION FOR SEQ ID NO : 2: 

96 (i) SEQUENCE CHARACTERISTICS: 

97 (A) LENGTH: 1459 base pairs 

98 (B) TYPE: nucleic acid 

99 (C) STRANDEDNESS : single 

100 (D) TOPOLOGY: linear 

102 (vii) IMMEDIATE SOURCE: 

103 (A) LIBRARY: BRAITUT03 

104 (B) CLONE: 865744 



106 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 








108 


AGTAAAAGCA 


GCCGAATCAA 


TTGATCAGAA 


AAATGATTCA 


CAGCTGGTAA 


TAGAAGCTTA 


60 


109 


TAAATCAGGG 


TTTGAGCCTC 


CTGGAGACAT 


TGAATTTGAG 


GATTACACTC 


AGCCAATGAA 


120 


110 


GCGCACTGTG 


TCAGATAACA 


GCCTTTCAAA 


TTCCAGAGGA 


GAAGGCAAAC 


CAGACCTCAA 


180 


111 


ATTTGGTGGC 


AAATCCAAAG 


GAAAGTTATG 


GCCGTTCATC 


AAAAAAAATA 


AGGGTGCAAC 


240 


112 


ACCGGAGGAT 


TTCAGCAACC 


TCCCACCTGA 


ACAAAGAAGG 


AAAAAGCTGC 


AGCAGAAAGT 


300 


113 


CGATGAGTTA 


AATAAAGAAA 


TTCAGAAGGA 


GATGGATCAA 


AGAGATGCCA 


TAACAAAAAT 


360 


114 


GAAAGATGTC 


TACCTAAAGA 


ATCCTCAGAT 


GGGAGACCCA 


GCCAGTTTGG 


ATCACAAATT 


420 


115 


AGCAGAAGTC 


AGCCAAAATA 


TAGAGAAACT 


GCGAGTAGAG 


ACCCAGAAAT 


TTGAGGCCTG 


480 


116 


GCTGGCTGAG 


GTTGAAGGCC 


GGCTCCCAGC 


ACGCAACGAG 


CAGGCGCGCC 


GGCAGAGCGG 


540 


117 


ACTGTACGAC 


AGCCAGAACC 


CACCCACAGT 


CAACAACTGC 


GCCCAGGACC 


GTGAGAGCCC 


600 
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118 AGATGGCAGT TACACAGAGG AGCAGAGTCA GGAGAGTGAG ATGAAGGTGC TGGCCACGGA 660 

119 TTTTGACGAC GAGTTTGATG ATGAGGAGCC CCTCCCTGCC ATAGGGACGT GCAAAGCTCT 720 

120 CTACACATTT GAAGGTCAGA ATGAAGGAAC GATTTCCGTA GTTGAAGGAG AAACATTGTA 780 

121 TGTCATAGAG GAAGACAAAG GCGATGGCTG GACCCGCATT CGGAGAAATG AAGATGAAGA 84 0 

122 GGGTTATGTC CCCACTTCAT ATGTCGAAGT CTGTTTGGAC AAAAATGCCA AAGGTGCTAA 900 

123 GACTTATATT TAATACCATA AAAAAAAAAA ACTTAAAAAA AATGGAGTTG TTTCTCCCCA 960 

124 CAACCGTGAC TGTTACAGGC AGTTCCTCAA GAGACTGGCT GGCAAGCACC ATAATGCACG 1020 
12 5 TTCTCCTGTA GTCTCACGTG GACTTCAGGG TCCGGGCACC TGAATTGCCT TGTCTAGTTT 1080 

126 GGGCTGTAAT CAAGTTTCAC TTGCTGATGA AATTTTATGT GGAAAGCTGC CAACCGCCAA 1140 

127 CTTACAGCTA TGTCATTCAA AATCTGATAA ACATTTCTTC TTTTGGCGGT ATCTGTAGAT 1200 
12 8 TAAAAAAAAA GTTGCATTGT AGCTTCTCAT CTTTCTGAAT TTAAAAGCCG GCACGCATCA 1260 
12 9 TGCAGGTGCC AAAGACTTCC CTACTCTTGT TTATATCTAG TATCCACCAT ACACTGAGCT 1320 

130 ACATTAGGTG GTTACAGATT GTAACTTAAT AAACTGAACT GTGTTAGTTT GTTAAATTGG 1380 

131 ATACTCATTC ACTTGGGGAG GAGTCACAAG TGAAATACCA TCTCTTTCTT GACTAAAGCG 1440 

132 GTAAATAAGG TTCTTATTG 14 59 
134 (2) INFORMATION FOR SEQ ID NO: 3: 

136 (i) SEQUENCE CHARACTERISTICS: 

137 (A) LENGTH: 175 amino acids 

138 (B) TYPE: amino acid 

139 (C) STRANDEDNESS: single 
14 0 (D) TOPOLOGY: linear 

14 2 (vii) IMMEDIATE SOURCE: 

143 (A) LIBRARY: PROSNOT20 

144 (B) CLONE: 1816529 

14 6 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

14 8 Met Lys Asp Val Tyr Glu Lys Thr Pro Gin Met Gly Asp Pro Ala Ser 

149 15 10 15 

150 Leu Glu Pro Gin lie Ala Glu Thr Leu Ser Asn lie Glu Arg Leu Lys 

151 20 25 30 

152 Leu Glu Val Gin Lys Tyr Glu Ala Trp Leu Ala Glu Ala Glu Ser Arg 

153 35 40 45 

154 Val Leu Ser Asn Arg Gly Asp Ser Leu Ser Arg His Ala Arg Pro Pro 

155 50 55 60 

W--> 156 Xaa Pro Pro Ala Ser Ala Pro Pro Asp Ser Ser Ser Asn Ser Ala Ser 

157 65 70 75 80 

158 Gin Asp Thr Lys Glu Ser Ser Glu Glu Pro Pro Ser Glu Glu Ser Gin 

159 85 90 95 

16 0 Asp Thr Pro lie Tyr Thr Glu Phe Asp Glu Asp Phe Glu Glu Glu Pro 

161 100 105 110 

162 Thr Ser Pro lie Gly His Cys Val Ala lie Tyr His Phe Glu Gly Ser 

163 115 120 125 

164 Ser Glu Gly Thr lie Ser Met Ala Glu Gly Glu Asp Leu Ser Leu Met 

165 130 135 140 

166 Glu Glu Asp Lys Gly Asp Gly Trp Thr Arg Val Arg Arg Lys Glu Gly 

167 145 150 155 160 

168 Gly Glu Gly Tyr Val Pro Thr Ser Tyr Leu Arg Val Thr Leu Asn 

169 165 170 175 
171 (2) INFORMATION FOR SEQ ID NO: 4: 

173 (i) SEQUENCE CHARACTERISTICS: 
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174 
175 
176 
177 
179 
180 
181 
183 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: PROSNOT20 

(B) CLONE: 1816529 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 



(A) LENGTH: 773 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



185 ATGAACCGTG CACCCTNCGA CAGCAGTCTG GGCACCCCCT ACGGATGGAC GGNCTGAACT 

186 CCGAGGNCCG GGTCGCAGCC GCACCAAGCG CTGGNCTTTT GGCAAGAAGA ACAAGACAGT 

187 GGTGACCGAG GATTTTAGCC ACTTGCCCCC AGAGCAGCAG CGAAAACGGC TTCAACAGCA 

188 GTTGGAAGAA CGCAGTCGTG AACTTCAGAA GGAGGTTGAC CAGAGGGAAG CCCTAAAGAA 

189 AATGAAGGAT GTCTATGAGA AGACACCTCA GATGGGGGAC CCCGCCAGCT TGGAGCCCCA 

190 GATCGCTGAA ACCCTGAGCA ACATTGAACG GCTGAAATTG GAAGTGCAGA AGTATGAGGC 

191 GTGGCTGGCA GAAGCTGAAA GTCGAGTCCT TAGCAACCGG GGAGACAGCC TGAGCCGGCA 

192 CGCCCGGCCT CCCGANCCCC CCGCTAGCGC CCCGCCAGAC AGCAGCAGCA ACAGCGCATC 

193 ACAGGACACC AAGGAGAGCT CTGAAGAGCC TCCCTCAGAA GAGAGCCAGG ACACCCCCAT 

194 TTACACGGAG TTTGATGAGG ATTTCGAGGA GGAACCCACA TCCCCCATAG GTCACTGTGT 

195 GGCCATCTAC CACTTTGAAG GGTCCAGCGA GGGCACTATC TCTATGGCCG AGGGTGAAGA 

196 CCTCAGTCTT ATGGAAGAAG ACAAAGGGGA CGGCTGGACC CGGGTCAGGC GGAAAGAGGG 

197 AGGCGAGGGC TACGTGCCCA CCTCCTACCT CCGAGTCACG CTCAATTGAA CCC 
199 (2) INFORMATION FOR SEQ ID NO : 5: 

201 (i) SEQUENCE CHARACTERISTICS: 

202 (A) LENGTH: 237 amino acids 

203 (B) TYPE: amino acid 

204 (C) STRANDEDNESS: single 

205 (D) TOPOLOGY: linear 

207 (vii) IMMEDIATE SOURCE: 

208 (A) LIBRARY: GenBank 

209 (B) CLONE: 1255033 

211 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

213 Lys lie His Cys Phe Arg Ser Leu Lys Arg Gly Gly Val Thr Pro Glu 

214 15 10 15 

215 Asp Phe Ser Asn Phe Pro Pro Glu Gin Arg Arg Lys Lys Leu Gin Gin 

216 20 25 30 

217 Lys Val Asp Asp Leu Asn Arg Glu lie Gin Lys Glu Thr Asp Gin Arg 

218 35 40 45 

219 Asp Ala lie Thr Lys Met Lys Asp Val Tyr Leu Lys Asn Pro Gin Met 

220 50 55 60 

221 Gly Asp Pro Ala Ser Leu Asp Gin Lys Leu Thr Glu Val Thr Gin Asn 

222 65 70 75 80 

223 lie Glu Lys Leu Arg Leu Glu Ala Gin Lys Phe Glu Ala Trp Leu Ala 

224 85 90 95 

225 Glu Val Glu Gly Arg Leu Pro Ala Arg Ser Glu Gin Ala Arg Arg Gin 

226 100 105 110 

22 7 Ser Gly Leu Tyr Asp Gly Gin Thr His Gin Thr Val Thr Asn Cys Ala 

228 115 120 125 

229 Gin Asp Arg Glu Ser Pro Asp Gly Ser Tyr Thr Glu Glu Gin Ser Gin 

230 130 135 140 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
773 
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Glu 
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He 
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Tyr 


Thr 
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Phe 


Glu 


Gly Gin 


Asn 


Glu 


Gly 


Thr 


He Ser 


Val 


Val 


Glu 


Gly 


Glu 


Thr 


236 






180 










185 








190 






237 


Leu 


Ser 


Val He 


Glu 


Glu 


Asp 


Lys 


Gly Asp 


Gly 


Trp 


Thr 


Arg 


He 


Arg 


238 






195 








200 








205 








239 


Arg 


Asn 


Glu Asp 


Glu 


Glu 


Gly 


Tyr 


Phe Pro 


Thr 


Ser 


Tyr 


Val 


Glu 


Val 


240 




210 








215 








220 










241 


Tyr 


Leu 


Asp Lys 


Asn 


Ala 


Lys 


Gly Ala Lys 


Thr 


Tyr 


He 








242 


225 








230 








235 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/925,122 



DATE: 11/13/2001 
TIME: 14:41:54 



Input Set : N:\Crf3\RULE60\09925122.txt 
Output Set: N:\CRF3\11132001\I925122.raw 



L:3 M:220 C: Keyword misspelled or invalid format, [(1) GENERAL INFORMATION : ] 
L:9 M:220 C: Keyword misspelled or invalid format, [(ii) TITLE OF INVENTION:] 
L:29 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 
L:30 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
L:156 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 
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